Predictive accuracy of risk factors and markers: a simulation study of the effect of novel markers on different performance measures for logistic regression models
نویسندگان
چکیده
The change in c-statistic is frequently used to summarize the change in predictive accuracy when a novel risk factor is added to an existing logistic regression model. We explored the relationship between the absolute change in the c-statistic, Brier score, generalized R(2) , and the discrimination slope when a risk factor was added to an existing model in an extensive set of Monte Carlo simulations. The increase in model accuracy due to the inclusion of a novel marker was proportional to both the prevalence of the marker and to the odds ratio relating the marker to the outcome but inversely proportional to the accuracy of the logistic regression model with the marker omitted. We observed greater improvements in model accuracy when the novel risk factor or marker was uncorrelated with the existing predictor variable compared with when the risk factor has a positive correlation with the existing predictor variable. We illustrated these findings by using a study on mortality prediction in patients hospitalized with heart failure. In conclusion, the increase in predictive accuracy by adding a marker should be considered in the context of the accuracy of the initial model.
منابع مشابه
Accuracy of obesity indices alone or in combination for prediction of diabetes: A novel risk score by linear combination of general and abdominal measures of obesity
Background: The predictive power of obesity measures varies according to the presence of coexistent measures. The present study aimed to determine the predictive power of combinations of obesity measures for diabetes by calculation of a linear risk score. Methods: Data from a population-based cross-sectional study of 994 representative samples of Iranian adults in Babol, Iran were analyzed. Me...
متن کاملFactors Influencing Drug Injection History among Prisoners: A Comparison between Classification and Regression Trees and Logistic Regression Analysis
Background: Due to the importance of medical studies, researchers of this field should be familiar with various types of statistical analyses to select the most appropriate method based on the characteristics of their data sets. Classification and regression trees (CARTs) can be as complementary to regression models. We compared the performance of a logistic regression model and a CART in predi...
متن کاملHybrid Method of Logistic Regression and Data Envelopment Analysis for Event Prediction: A Case Study (Stroke Disease)
Abstract Predictive analytics is an area of statistics that deals with extracting information from data and using it to predict trends and behavior patterns. Many mathematical modeling has been developed and used for prediction, and in some cases, they have been found to be very strong and reliable. This paper studies different mathematical and statistical approaches for events prediction. The ...
متن کاملPredictive factors for general health status in Iranian high school students an unvariate and multivariate logistic regression analysis
Introduction: Adolescents are the important portions of the Iranian population. Adolescents health status has a critical role in their learning ability and their performance. The present study aimed to determine the predictive factors for general health status in Iranian high school students in 2014. Materials and methods: In a cross-sectional study evaluated the predictive factors for general...
متن کاملارائه مدلی جهت پیش بینی بیماری دیابت با استفاده از شبکه عصبی
Introduction: Meta-heuristic and combined algorithms have a great capability in modelling medical decision making. This study used neural networks in order to predict Type 2 Diabetes (T2D) among high risk individuals. Methods: This study was an applied research. Data from 545 individuals (diabetic and non-diabetic), in Diabetes Clinic of Hamedan University of Medical Sciences, we...
متن کامل